Search CORE

106 research outputs found

MicrO: an ontology of phenotypic and metabolic characters, assays, and culture media found in prokaryotic taxonomic descriptions

Author: Carrine E. Blank
Hong Cui
Lisa R. Moore
Ramona L. Walls
Publication venue: Springer Nature
Publication date: 01/01/2016
Field of study

Background: MicrO is an ontology of microbiological terms, including prokaryotic qualities and processes, material entities (such as cell components), chemical entities (such as microbiological culture media and medium ingredients), and assays. The ontology was built to support the ongoing development of a natural language processing algorithm, MicroPIE (or, Microbial Phenomics Information Extractor). During the MicroPIE design process, we realized there was a need for a prokaryotic ontology which would capture the evolutionary diversity of phenotypes and metabolic processes across the tree of life, capture the diversity of synonyms and information contained in the taxonomic literature, and relate microbiological entities and processes to terms in a large number of other ontologies, most particularly the Gene Ontology (GO), the Phenotypic Quality Ontology (PATO), and the Chemical Entities of Biological Interest (ChEBI). We thus constructed MicrO to be rich in logical axioms and synonyms gathered from the taxonomic literature. Results: MicrO currently has similar to 14550 classes (similar to 2550 of which are new, the remainder being microbiologically-relevant classes imported from other ontologies), connected by similar to 24,130 logical axioms (5,446 of which are new), and is available at (http://purl.obolibrary.org/obo/MicrO.owl) and on the project website at https://github.com/carrineblank/MicrO. MicrO has been integrated into the OBO Foundry Library (http://www.obofoundry.org/ontology/micro.html), so that other ontologies can borrow and re-use classes. Term requests and user feedback can be made using MicrO's Issue Tracker in GitHub. We designed MicrO such that it can support the ongoing and future development of algorithms that can leverage the controlled vocabulary and logical inference power provided by the ontology. Conclusions: By connecting microbial classes with large numbers of chemical entities, material entities, biological processes, molecular functions, and qualities using a dense array of logical axioms, we intend MicrO to be a powerful new tool to increase the computing power of bioinformatics tools such as the automated text mining of prokaryotic taxonomic descriptions using natural language processing. We also intend MicrO to support the development of new bioinformatics tools that aim to develop new connections between microbial phenotypes and genotypes (i.e., the gene content in genomes). Future ontology development will include incorporation of pathogenic phenotypes and prokaryotic habitats.This work was funded by grants from the National Science Foundation (award DEB-1208534 to CEB, DEB-1208567 to HC, and DEB-1208685 to LRM) and by a travel grant (to CEB) to attend the 2013 NESCent Ontologies for Evolutionary Biology workshop. RW was supported by CyVerse and the National Science Foundation under award numbers DBI-0735191 and DBI-1265383. Many thanks to Elvis Hsin-Hui Wu (University of Arizona), Gail Gasparich (Towson University), and Gordon Burleigh (University of Florida) for comments and/or assistance with ontology construction and compilation of taxonomic descriptions. We would also like to thank Chris Mungall (LBNL), Oliver He (University of Michigan) for technical assistance with OntoBee and OntoFox, and Gareth Owen (ChEBI project leader, head curator) and other curators at ChEBI for assistance in the incorporation of microbial-specific chemical terms and synonyms into ChEBI. Thanks also to the instructors (Melissa Haendel, Matt Yoder, Jim Balhoff) and students of the 2013 NESCent Ontologies for Evolutionary Biology workshop, and to Karen Cranston (NESCent) and the support staff at NESCent. Thanks also to the OBI-devel team for comments regarding the overall structure of assay terms, and associated object properties, in MicrO.This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]

Springer - Publisher Connector

PubMed Central

The University of Arizona

The Plant Ontology: A common reference ontology for plants

Author: Chris Mungall
Cooper Laurel D.
Gandolfo Maria A.
Justin Elser
Pankaj Jaiswal
Smith Barry
Stevenson Dennis W.
Walls Ramona L.
Publication venue
Publication date: 01/01/2010
Field of study

The Plant Ontology (PO) (http://www.plantontology.org) (Jaiswal et al., 2005; Avraham et al., 2008) was designed to facilitate cross-database querying and to foster consistent use of plant-specific terminology in annotation. As new data are generated from the ever-expanding list of plant genome projects, the need for a consistent, cross-taxon vocabulary has grown. To meet this need, the PO is being expanded to represent all plants. This is the first ontology designed to encompass anatomical structures as well as growth and developmental stages across such a broad taxonomic range. While other ontologies such as the Gene Ontology (GO) (The Gene Ontology Consortium, 2010) or Cell Type Ontology (CL) (Bard et al., 2005) cover all living organisms, they are confined to structures at the cellular level and below. The diversity of growth forms and life histories within plants presents a challenge, but also provides unique opportunities to study developmental and evolutionary homology across organisms

PhilPapers

Community next steps for making globally unique identifiers work for biocollections data

Author: Agosti Donat
Catapano Terry
Cellinese Nico
Deck John
Guralnick Robert P.
Hagedorn Gregor
Kunze John
Page Roderic D.
Penev Lyubomir
Pyle Richard L.
Walls Ramona
Wieczorek John
Publication venue: Pensoft Publishers
Publication date: 01/01/2015
Field of study

Biodiversity data is being digitized and made available online at a rapidly increasing rate but current practices typically do not preserve linkages between these data, which impedes interoperation, provenance tracking, and assembly of larger datasets. For data associated with biocollections, the biodiversity community has long recognized that an essential part of establishing and preserving linkages is to apply globally unique identifiers at the point when data are generated in the field and to persist these identifiers downstream, but this is seldom implemented in practice. There has neither been coalescence towards one single identifier solution (as in some other domains), nor even a set of recommended best practices and standards to support multiple identifier schemes sharing consistent responses. In order to further progress towards a broader community consensus, a group of biocollections and informatics experts assembled in Stockholm in October 2014 to discuss community next steps to overcome current roadblocks. The workshop participants divided into four groups focusing on: identifier practice in current field biocollections; identifier application for legacy biocollections; identifiers as applied to biodiversity data records as they are published and made available in semantically marked-up publications; and cross-cutting identifier solutions that bridge across these domains. The main outcome was consensus on key issues, including recognition of differences between legacy and new biocollections processes, the need for identifier metadata profiles that can report information on identifier persistence missions, and the unambiguous indication of the type of object associated with the identifier. Current identifier characteristics are also summarized, and an overview of available schemes and practices is provided

Biodiversity Heritage Library OAI Repository

ZENODO

Directory of Open Access Journals

PubMed Central

Enlighten

ARPHA OAI-PMH Endpoint

ARPHA Preprints

The environment ontology in 2016: bridging domains with increased scope, semantic density, and interoperation

Author: Buttigieg Pier Luigi
Lewis Suzanna E.
Mungall Christopher J.
Pafilis Evangelos
Schildhauer Mark P.
Walls Ramona L.
Publication venue
Publication date: 01/01/2016
Field of study

Background The Environment Ontology (ENVO; http://www.environmentontology.org/), first described in 2013, is a resource and research target for the semantically controlled description of environmental entities. The ontology's initial aim was the representation of the biomes, environmental features, and environmental materials pertinent to genomic and microbiome-related investigations. However, the need for environmental semantics is common to a multitude of fields, and ENVO's use has steadily grown since its initial description. We have thus expanded, enhanced, and generalised the ontology to support its increasingly diverse applications. Methods We have updated our development suite to promote expressivity, consistency, and speed: we now develop ENVO in the Web Ontology Language (OWL) and employ templating methods to accelerate class creation. We have also taken steps to better align ENVO with the Open Biological and Biomedical Ontologies (OBO) Foundry principles and interoperate with existing OBO ontologies. Further, we applied text-mining approaches to extract habitat information from the Encyclopedia of Life and automatically create experimental habitat classes within ENVO. Results Relative to its state in 2013, ENVO's content, scope, and implementation have been enhanced and much of its existing content revised for improved semantic representation. ENVO now offers representations of habitats, environmental processes, anthropogenic environments, and entities relevant to environmental health initiatives and the global Sustainable Development Agenda for 2030. Several branches of ENVO have been used to incubate and seed new ontologies in previously unrepresented domains such as food and agronomy. The current release version of the ontology, in OWL format, is available at http://purl.obolibrary.org/obo/envo.owl. Conclusions ENVO has been shaped into an ontology which bridges multiple domains including biomedicine, natural and anthropogenic ecology, ‘omics, and socioeconomic development. Through continued interactions with our users and partners, particularly those performing data archiving and sythesis, we anticipate that ENVO’s growth will accelerate in 2017. As always, we invite further contributions and collaboration to advance the semantic representation of the environment, ranging from geographic features and environmental materials, across habitats and ecosystems, to everyday objects in household settings

ZENODO

Springer - Publisher Connector

PubMed Central

Electronic Publication Information Center

eScholarship - University of California

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

MPG.PuRe

Recommended from our members

Transforming the study of organisms: Phenomic data models and knowledge bases

Author: Balhoff James P.
Buttigieg Pier Luigi
Haendel Melissa A.
McGuinness Deborah L.
Mungall Christopher J.
Singer Jessica
Stucky Brian J.
Thessen Anne E.
Vogt Lars
Walls Ramona L.
Warren Robert
Yoder Matthew J.
Publication venue: San Francisco, Calif. : Public Library of Science
Publication date: 01/01/2020
Field of study

The rapidly decreasing cost of gene sequencing has resulted in a deluge of genomic data from across the tree of life; however, outside a few model organism databases, genomic data are limited in their scientific impact because they are not accompanied by computable phenomic data. The majority of phenomic data are contained in countless small, heterogeneous phenotypic data sets that are very difficult or impossible to integrate at scale because of variable formats, lack of digitization, and linguistic problems. One powerful solution is to represent phenotypic data using data models with precise, computable semantics, but adoption of semantic standards for representing phenotypic data has been slow, especially in biodiversity and ecology. Some phenotypic and trait data are available in a semantic language from knowledge bases, but these are often not interoperable. In this review, we will compare and contrast existing ontology and data models, focusing on nonhuman phenotypes and traits. We discuss barriers to integration of phenotypic data and make recommendations for developing an operationally useful, semantically interoperable phenotypic data ecosystem

Repositorium für Naturwissenschaften und Technik

An ontology approach to comparative phenomics in plants

Author: Cannon Ethalinda K. S.
Cannon Steven B.
Cooper Laurel
Gardiner Jack
Gkoutos Georgios V.
Harper Lisa
He Mingze
Hoehndorf Robert
Huala Eva
Jaiswal Pankaj
Kalberer Scott R.
Lawrence Carolyn J.
Lloyd John P.
Meinke David
Menda Naama
Moore Laura
Nelson Rex T.
Oellrich Anika
Pujar Anuradha
Walls Ramona L.
Publication venue
Publication date: 01/01/2015
Field of study

BACKGROUND: Plant phenotype datasets include many different types of data, formats, and terms from specialized vocabularies. Because these datasets were designed for different audiences, they frequently contain language and details tailored to investigators with different research objectives and backgrounds. Although phenotype comparisons across datasets have long been possible on a small scale, comprehensive queries and analyses that span a broad set of reference species, research disciplines, and knowledge domains continue to be severely limited by the absence of a common semantic framework. RESULTS: We developed a workflow to curate and standardize existing phenotype datasets for six plant species, encompassing both model species and crop plants with established genetic resources. Our effort focused on mutant phenotypes associated with genes of known sequence in Arabidopsis thaliana (L.) Heynh. (Arabidopsis), Zea mays L. subsp. mays (maize), Medicago truncatula Gaertn. (barrel medic or Medicago), Oryza sativa L. (rice), Glycine max (L.) Merr. (soybean), and Solanum lycopersicum L. (tomato). We applied the same ontologies, annotation standards, formats, and best practices across all six species, thereby ensuring that the shared dataset could be used for cross-species querying and semantic similarity analyses. Curated phenotypes were first converted into a common format using taxonomically broad ontologies such as the Plant Ontology, Gene Ontology, and Phenotype and Trait Ontology. We then compared ontology-based phenotypic descriptions with an existing classification system for plant phenotypes and evaluated our semantic similarity dataset for its ability to enhance predictions of gene families, protein functions, and shared metabolic pathways that underlie informative plant phenotypes. CONCLUSIONS: The use of ontologies, annotation standards, shared formats, and best practices for cross-taxon phenotype data analyses represents a novel approach to plant phenomics that enhances the utility of model genetic organisms and can be readily applied to species with fewer genetic resources and less well-characterized genomes. In addition, these tools should enhance future efforts to explore the relationships among phenotypic similarity, gene function, and sequence similarity in plants, and to make genotype-to-phenotype predictions relevant to plant biology, crop improvement, and potentially even human health.This item is part of the UA Faculty Publications collection. For more information this item or other items in the UA Campus Repository, contact the University of Arizona Libraries at [email protected]

Aberystwyth Research Portal

University of Birmingham Research Portal

Springer - Publisher Connector

PubMed Central

The University of Arizona

Recommended from our members

Ontologies as Integrative Tools for Plant Science

Author: Athreya Balaji
Cooper Laurel
Elser Justin
Gandolfo Maria A.
Jaiswal Pankaj
Mungall Christopher J.
Preece Justin
Rensing Stefan
Smith Barry
Stevenson Dennis W.
Walls Ramona L.
Publication venue: 'Botanical Society of America'
Publication date
Field of study

Premise of the study: Bio-ontologies are essential tools for accessing and analyzing the rapidly growing pool of plant genomic and phenomic data. Ontologies provide structured vocabularies to support consistent aggregation of data and a semantic framework for automated analyses and reasoning. They are a key component of the semantic web. Methods: This paper provides background on what bio-ontologies are, why they are relevant to botany, and the principles of ontology development. It includes an overview of ontologies and related resources that are relevant to plant science, with a detailed description of the Plant Ontology (PO). We discuss the challenges of building an ontology that covers all green plants (Viridiplantae). Key results: Ontologies can advance plant science in four keys areas: (1) comparative genetics, genomics, phenomics, and development; (2) taxonomy and systematics; (3) semantic applications; and (4) education. Conclusions: Bio-ontologies offer a flexible framework for comparative plant biology, based on common botanical understanding. As genomic and phenomic data become available for more species, we anticipate that the annotation of data with ontology terms will become less centralized, while at the same time, the need for cross-species queries will become more common, causing more researchers in plant science to turn to ontologies.Keywords: Bio-ontologies, Plant Ontology, Plant genomics, OBO Foundry, Plant systematics, Plant anatomy, Genome annotation, Semantic web, PhenomicsKeywords: Bio-ontologies, Plant Ontology, Plant genomics, OBO Foundry, Plant systematics, Plant anatomy, Genome annotation, Semantic web, Phenomic

ScholarsArchive@OSU

Recommended from our members

An Extension of the Plant Ontology Project Supporting Wood Anatomy and Development Research

Author: Cooper Laurel
Gandolfo Maria Alejandra
Groover Andrew
Jaiswal Pankaj
Lachenbruch Barbara
Lens Frederic
Spicer Rachel
Staton Margaret E.
Stevenson Dennis W.
Walls Ramona L.
Wegrzyn Jill
Publication venue: International Association of Wood Anatomists
Publication date
Field of study

A wealth of information on plant anatomy and morphology is available in the current and historical literature, and molecular biologists are producing massive amounts of transcriptome and genome data that can be used to gain better insights into the development, evolution, ecology, and physiological function of plant anatomical attributes. Integrating anatomical and molecular data sets is of major importance to the field of wood science, but this is often hampered by the lack of a standardized, controlled vocabulary that allows for cross-referencing among disparate data types. One approach to overcome this obstacle is through the annotation of data using a common controlled vocabulary or “ontology” (Ashburner et al. 2000; Smith et al. 2007). An ontology is a way of representing knowledge in a given domain that includes a set of terms to describe the classes in that domain, as well as the relationships among terms. Each term can be associated with an array of data such as names, definitions, identification numbers, and genes involved. Ontologies are fundamental for unifying diverse terminologies and are increasingly used by scientists, philosophers, the military and online web search engines. In an ontology, terms are carefully defined, allowing a wide array of researchers to (1) use terms consistently in scientific publications or standardized handbooks on quality/trait evaluations, and (2) search for and integrate data linked to these terms in anatomical, genetic, genomic, and other types of biological databases.This is the publisher’s final pdf. The published article is copyrighted by the International Association of Wood Anatomists and can be found at: http://science.naturalis.nl/media/337871/lens%20et%20al-2012%20iawa%20j%20plant%20ontology.pdf)

ScholarsArchive@OSU

Recommended from our members

Open Science principles for accelerating trait-based science across the Tree of Life.

Author: Adams Vanessa M
Alroy John
Andrew Samuel C
Ankenbrand Markus J
Balk Meghan A
Bland Lucie M
Boyle Brad L
Bravo-Avila Catherine H
Brennan Ian
Carthey Alexandra JR
Catullo Renee
Cavazos Brittany R
Chown Steven L
Conde Dalia A
Enquist Brian J
Fadrique Belen
Falster Daniel S
Feng Xiao
Gallagher Rachael V
Gibb Heloise
Halbritter Aud H
Hammock Jennifer
Hogan J Aaron
Holewa Hamish
Hope Michael
Iversen Colleen M
Jochum Malte
Kattge Jens
Kearney Michael
Keller Alexander
Mabee Paula
Madin Joshua S
Maitner Brian S
Manning Peter
McCormack Luke
Michaletz Sean T
Park Daniel S
Pearse William D
Penone Caterina
Perez Timothy M
Pineda-Munoz Silvia
Poelen Jorrit H
Ray Courtenay A
Rossetto Maurizio
Salguero-Gómez Roberto
Sauquet Hervé
Schneider Florian D
Sparrow Benjamin
Spasojevic Marko J
Telford Richard J
Tobias Joseph A
Vandvik Vigdis
Violle Cyrille
Walls Ramona
Weiss Katherine CB
Westoby Mark
Wright Ian J
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

Synthesizing trait observations and knowledge across the Tree of Life remains a grand challenge for biodiversity science. Species traits are widely used in ecological and evolutionary science, and new data and methods have proliferated rapidly. Yet accessing and integrating disparate data sources remains a considerable challenge, slowing progress toward a global synthesis to integrate trait data across organisms. Trait science needs a vision for achieving global integration across all organisms. Here, we outline how the adoption of key Open Science principles-open data, open source and open methods-is transforming trait science, increasing transparency, democratizing access and accelerating global synthesis. To enhance widespread adoption of these principles, we introduce the Open Traits Network (OTN), a global, decentralized community welcoming all researchers and institutions pursuing the collaborative goal of standardizing and integrating trait data across organisms. We demonstrate how adherence to Open Science principles is key to the OTN community and outline five activities that can accelerate the synthesis of trait data across the Tree of Life, thereby facilitating rapid advances to address scientific inquiries and environmental issues. Lessons learned along the path to a global synthesis of trait data will provide a framework for addressing similarly complex data science and informatics challenges

eScholarship - University of California

Oxford University Research Archive

Western Sydney ResearchDirect

Bern Open Repository and Information System (BORIS)

Monash University Research Portal

The Monarch Initiative in 2024: an analytic platform integrating phenotypes, genes and diseases across species.

Author: Alquaddoomi Faisal S
Braun Ian
Bruskiewich Richard M
Cappelletti Luca
Carbon Seth
Caron Anita R
Caufield J Harry
Chan Lauren E
Chute Christopher G
Cortes Katherina G
Cox Corey
De Souza Vinícius
Elsarboukh Glass
Fontana Tommaso
Gehrke Sarah
Haendel Melissa A
Harris Nomi L
Hartley Emily L
Hegde Harshad
Hurwitz Eric
Jacobsen Julius O B
Krishnamurthy Madan
Laraway Bryan J
Matentzoglu Nicolas
McLaughlin James A
McMurry Julie A
Moxon Sierra A T
Mullen Kathleen R
Mungall Christopher J
Munoz-Torres Monica C
O\u27Neil Shawn T
Osumi-Sutherland David
Putman Tim E
Reese Justin T
Robinson Peter N
Rubinetti Vincent P
Schaper Kevin
Shefchek Kent A
Smedley Damian
Stefancsik Ray
Toro Sabrina
Vasilevsky Nicole A
Walls Ramona L
Whetzel Patricia L
Publication venue: The Mouseion at the JAXlibrary
Publication date: 05/01/2024
Field of study

Bridging the gap between genetic variations, environmental determinants, and phenotypic outcomes is critical for supporting clinical diagnosis and understanding mechanisms of diseases. It requires integrating open data at a global scale. The Monarch Initiative advances these goals by developing open ontologies, semantic data models, and knowledge graphs for translational research. The Monarch App is an integrated platform combining data about genes, phenotypes, and diseases across species. Monarch\u27s APIs enable access to carefully curated datasets and advanced analysis tools that support the understanding and diagnosis of disease for diverse applications such as variant prioritization, deep phenotyping, and patient profile-matching. We have migrated our system into a scalable, cloud-based infrastructure; simplified Monarch\u27s data ingestion and knowledge graph integration systems; enhanced data mapping and integration standards; and developed a new user interface with novel search and graph navigation features. Furthermore, we advanced Monarch\u27s analytic tools by developing a customized plugin for OpenAI\u27s ChatGPT to increase the reliability of its responses about phenotypic data, allowing us to interrogate the knowledge in the Monarch graph using state-of-the-art Large Language Models. The resources of the Monarch Initiative can be found at monarchinitiative.org and its corresponding code repository at github.com/monarch-initiative/monarch-app

The Jackson Laboratory: The Mouseion at the JAXlibrary